TinyViT: Fast Pretraining Distillation for Small Vision Transformers

نویسندگان

چکیده

Vision transformer (ViT) recently has drawn great attention in computer vision due to its remarkable model capability. However, most prevailing ViT models suffer from huge number of parameters, restricting their applicability on devices with limited resources. To alleviate this issue, we propose TinyViT, a new family tiny and efficient small transformers pretrained large-scale datasets our proposed fast distillation framework. The central idea is transfer knowledge large ones, while enabling get the dividends massive pretraining data. More specifically, apply during for transfer. logits teacher are sparsified stored disk advance save memory cost computation overheads. student automatically scaled down parameter constraints. Comprehensive experiments demonstrate efficacy TinyViT. It achieves top-1 accuracy 84.8% ImageNet-1k only 21M being comparable Swin-B ImageNet-21k using 4.2 times fewer parameters. Moreover, increasing image resolutions, TinyViT can reach 86.5% accuracy, slightly better than Swin-L 11% Last but not least, good ability various downstream tasks. Code available at https://github.com/microsoft/Cream/tree/main/TinyViT .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Fast Method for Calculation of Transformers Leakage Reactance Using Energy Technique

Energy technique procedure for computing the leakage reactance in transformers is presented. This method is very efficient compared with the use of flux element and image technique and is also remarkably accurate. Examples of calculated leakage inductances and the short circuit impedance are given for illustration. For validation, the results are compared with the results obtained using practic...

متن کامل

Deriving fast distillation models: diploma thesis proposal

Distillation is the most important separation technology today. A trend in controlling distillation columns economically efficient is going toward using model predictive control (MPC) algorithms, which calculate an optimal input trajectory to the plant based on repeated simulations of a model of the process to predict the future behaviour of the plant with respect to disturbances and control in...

متن کامل

Fast and Accurate Robot Vision for Vision Based Motion

This paper describes the vision module from the soccer playing robots of the Dutch Team. Fast vision is necessary to get a close coupling with the motion software in order to allow fast turning and dribbling with the ball without loosing it. Accurate vision is necessary for the determination of the robot's position in the field and the accurate estimation of the ball position. Both fast and acc...

متن کامل

Fast Onboard Stereo Vision for UAVs

In the last decade researchers have built incredible new capabilities for small aircraft, with quadrotors moving from labs to toy stores and with autonomy reaching smaller and smaller vehicles. As the systems, and their payload capacities shrink, we can no longer use typical aircraft sensors such as RADAR, scanning LIDAR, and other active sensing methods for obstacle detection and avoidance. Sm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-19803-8_5